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Abstract — Renewable energy sources can be attached to sensor 
nodes to provide energy replenishment for extending the battery 
life and prolonging the overall lifetime of sensor networks. For 
networks with replenishment, conservative energy expenditure 
may lead to missed recharging opportunities due to battery 
capacity limitations, while aggressive usage of energy may result 
in reduced coverage or connectivity for certain time periods. 
Thus, new power allocation schemes need to be designed to 
balance these seemingly contradictory goals, in order to maximize 
sensor network performance. In this paper, we study the problem 
of how to jointly control the data queue and battery buffer to 
maximize the long-term average sensing rate of a wireless sensor 
network with replenishment under certain QoS constraints for 
the data and battery queues. We propose a unified algorithm 
structure and analyze the performance of the algorithm for all 
combinations of finite and infinite data and battery buffer sizes. 
We also provide a distributed version of the algorithm with 
provably efficient performance. 1 

I. Introduction 

Wireless sensor networks have been widely used in mon- 
itoring [1], maintenance [2], and environmental sensing [3]. 
The lack of easy access to a continuous power source and 
the limited lifetime of batteries have hindered the wide-scale 
deployment of such networks. However, new and exciting 
developments in the area of renewable energy [4] [5] provide 
an alternative to a limited power source, and may help alleviate 
some of the deployment challenges. These renewable sources 
of energy could be attached to the nodes and would typically 
provide energy replenishment at a slow rate (compared to the 
rate at which energy is consumed by a continuous stream of 
packet transmissions) that could be variable and dependent on 
the surroundings. 

Energy management in networks equipped with renewable 
sources is substantially and qualitatively different from energy 
management in traditional networks. For example, conserva- 
tive energy expenditure could lead to (i) very long delays 
because the energy is not being fully used to transmit at high 
enough data rates, and (ii) missed recharging opportunities 
because the battery buffer is full. On the other hand, an over 
aggressive use of energy may lead to lack of coverage or 
connectivity for certain time periods. Further, if the battery of a 
node discharges completely, it could be temporarily incapable 
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of transferring time-sensitive data. This may have undesirable 
consequences for many applications. Thus, new techniques and 
protocols must be developed for networks with replenishment 
to balance these seemingly contradictory goals. 

In this paper, we first consider a single link communication, 
in which the transmitting node has a data buffer that holds 
the incoming variable-rate sensing data and a battery buffer, 
which is being replenished at a variable rate. In our model, we 
allow any combination of finite and infinite data and battery 
buffer sizes. If the data buffer size is infinite, the concern is 
the stability of the data queue, while for finite data buffer, 
excessive data losses is the undesirable event. Likewise for 
the battery buffer, frequent occurrences of battery discharge 
should be avoided. We investigate the problem of maximizing 
the long-term average sensing rate, subject to the constraints 
on the stability of the data queue (or the desired data loss ratio 
when the data buffer size is finite) and the desired rate of visits 
to zero battery state. We provide a simple and unified joint rate 
control and power management framework, and show that the 
performance of our scheme is close to optimum. 

We then extend our algorithm to a multihop network setting 
with multiple source destination flows. We generalize our 
joint power and rate rate control algorithms along with a 
multihop routing scheme. We assign each link a weight that 
is a function of the data queue size, the battery state, and the 
chosen power level for that link. Then, we schedule a subset 
of links at a given point in time, depending on their weights 
using maximum weight matching or maximal matching, under 
the primary (or node exclusive) interference model, that has 
been shown to be a good model for Bluetooth of FH-CDMA 
systems with perfect orthogonal spreading codes and low 
power-spectrum density and used widely in wireless network 
control [6] [7]. We extend the performance analysis and 
show that our algorithm achieves provable performance in the 
multihop setting as well. 

While the problem of energy management in sensor net- 
works has seen considerable attention, there have been a 
number of works [8], [9], [10], [11], [12], [13], [14], [15], [16] 
that also address scenarios with energy replenishment. In [8], 
the authors consider the problem of dynamic node activation 
in rechargeable sensor networks. They provide a distributed 
threshold policy on the number of nodes that can be activated, 
which achieves a performance within a certain factor of the 
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optimal solution for a set of sensors whose coverage areas 
overlap completely. In [9], the authors study the problem of 
computing the lexicographically maximum data collection rate 
for each node in a sensor network such that no node will be out 
of energy. In [10], the authors consider the problem of energy- 
aware routing with distributed energy replenishment. They 
provide an algorithm that achieves a logarithmic competitive 
ratio and is asymptotically optimal with respect to the number 
of nodes in the network. In [14], the authors formulate a 
convex joint rate allocation and routing problem in an en- 
ergy replenishing wireless sensor network. They provide an 
implementation of the optimal solution based on a primal-dual 
approach as well as an asymptotically optimal heuristic power 
allocation scheme for finite battery sizes. Power allocation 
for wireless networks without replenishment has been widely 
studied, e.g., [17], [18]. In [18], the authors assume that the 
data buffer is large enough so that packet loss does not occur 
and provide a dynamic programming based solution. In [17], 
the authors develop approximate algorithms to minimize the 
average allocated power, or maximize the throughout given 
average power constraint, and at the same time keep the 
data queue stable. The approximation improves at the cost 
of increasing the data queue length and queueing delay. Most 
of these works assume a constant energy supply, i.e., there is 
no battery issue. The idea of [15] and this paper are motivated 
by [17]. Both papers model energy replenishment as a time- 
varying process and consider jointly managing the data and 
battery buffers. This coupling between the data and battery 
buffers is what makes the problem notoriously difficult to solve 
using standard optimization based approaches. For example, 
unlike [17] that utilizes the fact that a static allocation policy is 
optimal, it is not even clear whether a static policy would even 
be optimal in our setting. Specifically, with a battery buffer, 
there is an additional energy constraint that the allocated 
energy should be within the battery state, and this constraint 
is even more difficult than the average power constraint. In 
[15], the authors consider infinite data buffer and finite battery 
buffer sizes. They assume that the replenishing process is i.i.d 
(for non i.i.d process, they claim that a K slot analysis can be 
applied, which is complex and depends on the network size), 
and show that the probability of battery state being less than 
the peak power or close to the full battery state vanishes as 
the battery size grows, under an index policy. In this paper, 
we construct a framework that accommodates all combinations 
of finite and infinite data and battery buffer sizes by defining 
minimum number of virtual queues in a general format. In 
addition to the constraint on the stability of the data queue 
(constraint on the data loss ratio when the data buffer size is 
finite), we also have a constraint on the frequency of battery 
discharge. Rather than assuming an i.i.d replenishing process 
as in [15], we allow for a general replenishment process 
without assuming ergodicity and carefully design the virtual 
queues and explore the relations between actual queues and 
virtual queues to show that our algorithm is efficient. Further, 
we design a distributed algorithm in detail. 

The main intellectual contributions and challenges of the 



paper are as follows: 

• We formulate a sensing rate maximization problem with 
QoS constraints on both the data and battery queues. Due 
to the coupling between the battery and data buffers, a 
stationary policy could be suboptimal, hence traditional 
resource allocation techniques do not directly apply. 
Nonetheless, we are able to develop a simple and unified 
framework for all combinations of finite and infinite data 
and battery buffer sizes, and our algorithm is provably 
efficient. 

• We extend the algorithm from a single-link case to 
a multi-hop network under node-exclusive interference 
model with practical settings and develop an efficient 
distributed algorithm. 

II. Single Link Model 

We first consider a single link control model in this paper, 
as illustrated in Figure 1. We assume a time slotted system 
and in time slot t, the amount of data available for the sensor 
node to sense is denoted by A(t), which is upper bounded by 
A ma , x , (0 < A max < oo). In the same time slot, the amount of 
data the node actually senses and places in the data buffer is 
R(t). The amount of energy expended by the node at time t for 
data transmission is P(t), which is upper bounded by P pea k, 
(0 < Ppeak < oo) and the achievable data rate at that power 
level is n(P(t)), where we assume /i(-) to be monotonically 
increasing, reversible and differentiable on the half real line 
SR+ 1J{0}. The node has a battery of size Bi, (either Bb < 
oo or Bb = oo) with zero initial battery state. We let r(t) 
denote the amount of replenishment energy arriving to the 
node at time t. The energy state of the battery and the data 
state of the data buffer at time t is given by qb(t) and qd(t), 
respectively. The data buffer has size Bd (again, either Bd < 
oo or Bd = oo). Under this setting, we describe the objective 
in the following section. 
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Fig. 1. Single Link Control Model 



A. Problem Formulation 

Our general objective is to maximize the long-term average 
sensing rate, subject to the QoS constraints on both data and 
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battery queues: 
(A) max lim inf 4 R(i) 

K ' P.R T^oo T ^ w 



T-l 



t=0 



s.t. q d (t + 1) = min 



(q d (t) - /i(P(t))) + + R(t),B d 



q b (t + 1) = min ^(t) - P(i) + r(t), P 6 
0 < i2(t) < 

0 < P(i) < min [g 6 (t),P P eak 

Po < Vo, 

1 T_1 

limsup — Y] gr d (t) < oo (B d = oo), or 
Pd < % (-Bd < oo), 



(1) 
(2) 
(3) 
(4) 
(5) 



(6) 



where (•)+ = max[-, 0], R = {P(0), R(l), R(T-1), . . .} 
is the actual sensing data vector, P = {P(0), P(l), . . . , P(T- 
1), . . .} is the allocated power vector, and 
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Pd 



if liminiT^oo ^ J2t=o R( f ) = 0 
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otherwise 
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p 0 = lim sup i 7 0 (i) 



(7) 
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T^oo 



t=0 



are the long-term data loss ratio with an upper bound r\ d , 
and the frequency of visits to zero battery state with given 
threshold r\ 0 respectively, where 



D(t) = (^(q d (t) - ^(P(t))) + + R(t) - B d ^j , 

^o(^) — -^-battery hits zero state in slot t — 



(9) 



0 if P(t) < q b (t) 

1 otherwise 

(10) 

are the amount of data loss in slot and the indicator that 
the battery discharges completely in time slot t, respectively. 
Some applications may require the battery state to be always 
above certain positive level, then Equation (10) can be easily 
modified to represent how often the battery is below the 
desired level, and our solution structure works as well. Note 
that, we do not assume ergodicity of the system parameters, 
but if they are ergodic, then p a represents the actual probability 
of a complete discharge event as t — > oo. 

In Problem (A), Constraints (1) and (2) describe how the 
data and battery queues evolve, respectively. Especially, if 
B d = oo, (1) can be simplified to 

q d (t+l)= (q d (t) - n(P(t))Y + R(t), (1') 
and if B b = oo, (2) can be simplified to 

q b (t + l) = q b (t)-P(t)+r(t). (2') 

Constraint (3) bounds the actual amount of sensed data R(t) 
by the amount of available data A(t) in slot t. Constraint 



(4) states that we cannot oversubscribe the energy that is 
unavailable in the battery nor can we exceed the peak power 
level. Constraint (5) is the battery QoS constraint r/ 0 of 
the desired battery discharge rate. Constraint (6) is the QoS 
constraint for data queue: if B d = oo, we need to keep the data 
queue stable, and if B d < oo, the data loss ratio is required 
under a given threshold i] d . 

In this paper, our purpose is to develop a simple algorithm, 
which performs arbitrarily close to the optimal performance. 
To achieve that purpose, we do the following: 

• define virtual queues for both the data and battery buffer 
to avoid the difficulties involved in dealing with the data 
loss and battery discharge probability directly. We show 
that keeping the virtual queues stable ensures that the 
constraints on p d and p a are met. 
> design a power allocation scheme based on simple index 
policies and show that our scheme keeps both virtual 
queues stable and at the same time performs arbitrarily 
close to the optimal performance. 
« generalize our algorithm to the multihop scenario and 
develop distributed algorithms. 

B. Virtual Queues 

We define q d and q b as the virtual data and battery queues. 
The virtual queues evolve according to the following Lindley's 
queue evolution equations: 

q d (t + 1) = ((&(t) - VdR(t)) + - »(P(t)) + R(t) + 

(11) 

q b (t + 1) = ((&(*) - + P(t) - r(t) + M{t) + 7 0 (t)) + , 

(12) 

where I(t) — (p(P(t)) — q d (t)) + is the amount of transmitted 
idle packets when there is no enough data to transmit using 
the allocated energy, M(t) = (q b (t) - P(t) + r{t) - B b )+ is 
the amount of missed replenishing energy due to full battery 
when B b < oo, and l a (t) is defined in Equation (10). Note 
that if B b = oo, then M(t) = 0 and Equation (12) reduces to 

&(* + !)= ((q b (t)-Vo) + + P(t)-r(t)+l 0 (t)y. (13) 
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Fig. 2. Battery Queue and Virtual Battery Queue 

Figure 2 shows the relationship between the actual and 
virtual battery queues when the battery size is finite. The 
amount of change in q b {t) from time t to t+1 is P(t) — r(t) + 
M(t). Since the battery has a finite size, this term vanishes 
when averaged over an infinitely long period of time. Then, 
p 0 = \imsup T _ too Y^t=o lo{t) and i] D can be viewed as the 
long-term input and output rate of q b (t), respectively. Thus, it 
is reasonable to expect that q b {t) being stable implies p a < r\ 0 . 
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Without loss of generality, the initial state q b (0) and qd(0) 
can be set to be zero. The following proposition shows that 
if the virtual queues qd(t), q b {t) and the actual battery queue 
qb(t) are all strongly stable, pd and p a are guaranteed to meet 
their constraints. 

Proposition 1: If the virtual queues qd(t), q b (t) and the 
actual battery queue q b (t) are both strongly stable, i.e., 



1 T_1 

limsup - V (q d (t) + q b (t) + q b (t)) 



< OO, 



then p d < r\d and p 0 <r\ 0 . 

The proof of this result can be found in Appendix A. Next, 
we present our scheme. 

III. Joint Rate Control and Power Allocation 
Algorithm 

A. Algorithm 

The algorithm consists of two components: a rate control 
component and a power allocation component. Both compo- 
nents are index policies, i.e., the solutions are memoryless 
and they depend on the instantaneous values of the system 
variables. 

Rate Control (RC): 
We define 0 < V < oo to be the control parameter of our 
algorithm. Let Qd(t) = qd{t) when Bd — oo, and let Qd(t) 

I - rjd)q~d(t) when B d < oo. If Qa{t) < \, the transmitting 
node chooses to sense all the available data, i.e., R(t) = A(t). 
Otherwise, R(t) = 0. 

Power Allocation (PA): 
Solve the following optimization problem 

max Q d (t)n(P(t)) - q b (t)P{t), (14) 
P(t)en(t) 

where U(t) = {P(t) : 0 < P(t) < mm[q b (t), P pea k]} is a 
compact and nonempty set. Allocate P(t). 

The set, LT(t) of possible power allocations guarantees 
Constraint (d) on P(t) in Problem (A). If fi(-) is concave 2 , the 
objective function is a concave function of P(t). Consequently, 
PA solves a simple convex optimization problem in each time 
slot. The positive term Qd(t)fi(P(t)) can be viewed as the 
utility of the allocated power P(t) and the term q b (t)P(t) can 
be viewed as its associated cost. When the control parameter 
V is chosen to be large, Qd(t) tends to be large according to 
RC, and PA tries to allocate higher power P(t) to increase the 
utility, whereas, when the virtual battery queue length q b (i) is 
large, PA avoids allocating a high amount of power to reduce 
the cost. Thus, this index policy of PA can be viewed as a 
greedy profit maximization scheme. 

Observation 1: From the objective function in Equation (14), 
Qd(t)fi(P(t)) can be viewed as the utility. When Qd(t) is 
large, i.e., the data queue or the virtual data queue is large, 
PA tends to allocate a larger power P(t), so that Qd{t) will 

2 For instance, consider the additive white Gaussian noise channel 
capacity, fi(P) = log(l + P/jVn), where No is the two sided noise power 
spectral density. 



decrease. 

Observation 2: Once the battery is discharged, i.e., q b (t) = 0, 
q b (t) tends to increase as can be seen in Equation (12) or (13). 
Consequently, PA allocates a smaller power P(t), so that the 
discharge event happens infrequently. 

Observation 3: As q b {t) increases, q b {t) decreases, hence, PA 
tends to allocate a larger power P(t) to avoid battery overflow 
in advance. 

From Observation 2 and Observation 3, PA expends energy 
neither too conservatively nor too aggressively. 

B. Performance Analysis 

Recall that A(t) is the available amount of sensing data and 
R(t) is the actual amount of data the transmitting node chooses 
to sense. Clearly, using the rate controller, we make sure that 
the data queue remains within a certain bound. This has a 
positive effect on the battery as well, since a certain portion 
of the data packets are not allowed into the transmitting node. 
The natural question one would ask here is, whether our rate 
controller rejects too many packets in the first place to synthet- 
ically meet the constraints. In the following theorem, we show 
that this is not the case. Let A* = limim/r->oo ^ 2^2t=o 
be the optimal objective value of Problem (A) for any given 
exogenous process {A(t), t > 0} and the replenishment 
process {r(t), t > 0}. In the following theorem, we show 
that the sensing rate achieved by RC and PA is asymptotically 
close to the optimal sensing rate A* as Bd and B b tends to 
infinity. The optimal value can be achieved arbitrarily closely 
by increasing V. We use the notation y = 0(x) to represent 
y going to 0 as i goes to 0. 

Theorem 1: If the following conditions hold: 

(1) /z(-) is concave on 5R+ 1J{0}, and its slope at 0 satisfies 2 
0 < 0 = At'(0) < oo, 

(2) 0 < r(t) < r max , for all t > 0, 

then the joint power allocation and admission control algo- 
rithm (with RC and PA) achieves: 



Q d (t) <j+A max , Vt>0 
qb(t) < 0 (^+A Bm 

Po < Vo, 

Pd < Vd, 



Vt>0 



T-l 



T-l 



liminf- Vi?(t) >liminf- Vii*(i)-0(- 



t=0 



t=0 



0) 



V 

g(v, B dl B b ), 



(15) 

(16) 

(17) 
(18) 

(19) 



where fi m . dx 
rate, and 



g(V, B d , B b ) 



M(-Ppeak) is the upper bound for the transmission 



0{ 

o( { - 

0(< 



V 

a-B d )+ + 



V > 
lV-B b ) + 



if Bd = oo, B b = oo 
if Bd = oo, B b < oo 
if Bd < oo, B b = oo 



) + 0( 



if B d < oo, B b < oo. 
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The proof of Theorem 1 can be found in Appendix B. In 
Theorem 1, V is a finite tunable approximation parameter 
that controls the efficiency of the algorithm. Observe Equa- 
tion (19), which compares the performance of our algorithm 
with that of the optimal solution of Problem (A), the term 
iloiUnax + 0) captures the influence of battery outage, and it 
is small since the battery outage threshold r\ a is usually set 
to be very small to avoid network disconnection. Function 
g(V, Bd, Bb) represents the asymptotical property of the gap. 
If both battery and the data buffers are of infinite size, 
then the performance gap is identical to 0. Otherwise the 
performance gap is dictated by the finite (or the smaller) one of 
the buffers. One can observe that, by appropriately choosing 
the ^-parameter, we can make the performance gap decay 
inversely proportional to the buffer sizes. 

IV. Multihop Network Model 



1(0 





Fig. 3. Multihop Network Model 

We consider a multihop wireless sensor network with N 
nodes and L links as illustrated in Figure 3. Each node 
n £ K = {1,2, ... ,N} is attached to power sources for 
replenishment. Let A„(t) and Rn(t) denote the amount of 
available data for sensing and the actual amount of sensed 
data, to node n that are destined to node e in slot t. We 
assume that each node n maintains an infinite data buffer 
with state q^ ,e {t) for flows destined to e, and also maintains 
a finite battery buffer with size B^ and state q r h l (t) (We focus 
on infinite data buffer and finite battery buffer in order to 
emphasize the difference from single link to multihop. For 
other combinations of data and battery buffer sizes, then 
extension can be made similarly without too much effort). 
Let r n (t) denote the replenishment at node n in time slot 
t. The transmit power is chosen to be P/(t) over link I. In 
the formulation, we assume that the power the receiving node 
consumes to receive and decode the packet is identical to 
Pi(t) as well. The sole reason for this is simplicity and the 
generalization to the asymmetric case is straightforward (By 
defining a receiving transmitting power ratio, we can extend 
it to the general case with no technique difference). We use 
the node-exclusive interference model. Under this model, a 
node can only receive from or transmit to at most one node 
at any time slot. In each time slot t, with the assigned power 
Pl(t), the achieved data rate at link I is fn(Pi(t)) in that time 
slot, where the rate function m(-) is a non-decreasing, concave 
and differentiable function satisfying /i;(0) = 0. Let p" be 
the frequency of visits to the zero battery state for node n. 



Let Q, n and 9„ denote the set of directed links originated 
from node n and terminate at node n, respectively. We say 
P = [Pi(t) ■ ■ ■ P^it)] satisfies the node-exclusive model if 
Pi(t) > 0 for some I e fi„ U 0„, then P v = 0 for all 
I' 6 (Q n U©„)\{7}.Ina multihop network, we formulate the 
joint queue and energy management problem as follows: 

1 T_1 

(B) maxliminf-]T ]T R e n (t) 

t=0 n,e£N 

s.t. P(t) satisfies the node-exclusive model, (20) 



q n d e {t+i) < ( g n*)-E tf(m))) + +K(t) 



(21) 



qb(t+l) = min 



«?(*)- E W)+r n (t),BZ 
ien n ue n 



(22) 
(23) 



0< E W < min >^peak 
N 

EtfW)) = W(^(*))' R n(t) < K(t), (24) 



e=l 



T-l 



limsup^ 9d' 6 (*) < 00 ' n ^ e ' 



T^oo 



p n 0 < v:, 



t=0 



(25) 
(26) 



where 77" is the desired upper bound for p", P(t) is the 
power assignment vector for all links in slot t, P is the power 
assignment for all links over all time slots, and R is the actual 
sensing data vector for all node-destination pairs over all time 
slots. In Problem (B), the objective is to maximize the long- 
term average total sensing rate for all nodes destined to all 
destinations. 

In Problem (B), (20) is the interference constraint. Con- 
straints (21) and (22) describe how the data and battery queues 
evolve, respectively. Note that the destination node of each 
flow does not need to maintain a data buffer for that flow, as 
indicated in (21). Constraints (23) are the energy conservation 
equations stating that we cannot oversubscribe the energy that 
is unavailable in the battery nor can we exceed the peak power 
level. Constraints (24) are the rate conservation equations that 
bound the actual amount of sensed data R^ (t) by the available 
amount of data A e n (t), and share the transmission rate of a link 
among all the destinations in slot t. Constraint (25) is the QoS 
constraint for data queue: we need to keep all the data queues 
stable. Constraint (26) is the battery QoS constraint of the 
desired battery discharge rate 77™. 

Similarly, we define virtual queues for all n E N 



n" 
lb 



(t + l) = ((qUt)-^) + + E m)-r n {t) 

ien„ue„ 

+ M n (t) + l n 0 {t)Y ', (27) 
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where M n (t) = - E; e n„ue n p i (*) + r n (t) - B%) + 

is the amount of missed replenishment and 

7™(i) = indicator that battery hits zero state in slot t 
for node n 

= | 0 if E IenBU e B (*)<??(*) 



1 otherwise 



Next, we generalize the main results and algorithms RC and 
PA to the multihop scenario. 

Corollary 1: If all the virtual battery queues q£(t), VneN 
are strongly stable, we have p" < 77™, Vn e N. 
The proof is identical to the single hop scenario and can be 
found in Appendix C. We give the algorithm in the following 
section. 

V. Joint Rate Control, Power Allocation and 
Routing Algorithm for Multihop Networks 

The joint rate control, power allocation and routing algo- 
rithm for multihop networks can either be implemented in a 
centralized or distributed manner. For the centralized solution, 
we use the classical Maximal Weighted Matching (MWM) 
based algorithm and for the distributed algorithm, we can use 
the Maximal Matching (MM) based algorithms as in [19] [20]. 

Multihop Rate Control (MRC): 
Depending on whether Maximum Weight Matching or Maxi- 
mal Matching is employed by the scheduler, there is a slight 
difference in the implementation of MRC: 
Maximum Weighted Matching (MWM): If q^' e (t) < \ , node 
n chooses to sense all the available data packets, i.e., R^if) = 
A^(t); otherwise, reject all the arrivals, i.e., = 0. 

Maximal Matching (MM): If q^' e (t) < V , node n chooses 
to sense all the available data packets, i.e., R„(t) = A^(t); 
otherwise, reject all the arrivals, i.e., Rn(t) = 0. 

Multihop Power Allocation (MPA): 
Here the goal is to ensure that no node transfers data of a 
flow to a relay node that is not the destination of that flow, 
unless the differential backlog for that flow is greater than a 
fixed value 7 > 0. We will choose the value of 7 such that the 
resulting backlog of the receiving node is not larger than that 
of the transmitting node after the transmission. This pushes 
the data flow from the source to the destination with a positive 
back pressure. Let tran(Z) and rec(Z) denote the transmitting 
and receiving node of link I, respectively. We first define 



ll 



7 if rec(Z) ^ e 
0 otherwise 



constant. 

\t) - It) 



Let ei(t) = 
be the flow 



where 7 > 0 is some 

argmaxe |q d (t) - q d 

on link I that has the maximal modified differential backlog 

and wi(t) = max [q^ Mt) (t) - qf {l)Mt \t) - 7 f W ,0 
is the nonnegative differential backlog of I at time t. 
For each link I, solve 

' -tran(i) ^ 



Pi(t) 



max - (f b s 

t)enj(t) v 



~rec(H 



(t))Pl(t) 



(28) 



where II; (t) 



mm 



qr {l) (tu: c{l) (t),p, 



{W) 

peakj }■ 



< 



< 



Let Pf(t) be the solution for link I. With the calculated 
power Pf(t), let Wi{t) = «,,(t) W (P,'(t)) - (f b m{l) (t) + 

~rec(0^ps^ be the wdght Qn Unk 1 

For the whole network, we either use the MWM or MM 
as described below. We will analyze the performance of our 
solution when using each algorithm. 

Maximum Weighted Matching Algorithm: link I has weight 
Wi(t), then the weight of a matching M is Wj^{t) = 
^2i e j^Wi(t). The network chooses a maximum weighted 
matching in a centralized manner, the links in the cho- 
sen matching become active with the calculated transmitting 
power, and other links are not activated. 

Maximal Matching Algorithm: the network calculates a max- 
imal matching that achieves at least half of the total weight 
of MWM in a fully distributed manner as in [19] [20]. The 
links in the chosen matching become active with the calculated 
transmitting power, and other links are not activated. 

Multihop Routing: 
When wi(t) > 0, transmit for flow that is destined to 
ej(t) with rate m{Pi{t)), i.e., tf l(t) (Pi(t)) = w(fl(t)) and 
tf(P l (t))=0, Ve^e ; (t). 

Note that MRC and routing can be done by each node inde- 
pendently. We then give our main theorem for the multihop 
scenario: 

Theorem 2: If 

(1) fii(-) is concave on 5R + 1J{0}, and its slope at 0 satisfies 

0 < 0 = Atj(0) < 00, V/ e L, 

(2) Vrj£N:0< r n (t) < r max , Vi > 0, 

then the maximum weighted matching based joint rate 
control MRC, power allocation MPA, and routing algorithm 
achieves: 



lT(t) <^ + A m . dX , 

«?(t)</?(r+4n=), 



(29) 
(30) 



T-l 



n ( t=0 e ) 

( T-l 

> J] Hminf ^EE - ^"(Mrnax + 0) 



.o ( <M)l,l-o(I), 



(31) 



and the maximal matching based joint rate control MRC, 
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power allocation MPA, and routing algorithm achieves: 

# e (t) <V + A m . dx , 
ql l (t)</3(V + A max ), 



(32) 
(33) 



n L t=0 e J 

n ( t=0 e 



77"(Mmax + /3) 



(34) 



The proof of Theorem 2 can be found in Appendix D. The 
results can be interpreted similarly as Theorem 1. 

VI. Numerical Example 

We consider a network topology, shown in Figure 4 (a). 
There are 6 nodes, 7 links, and 2 flows with source-destination 
pair (3, 1) and (5, 2), respectively. In all simulations, the 
simulation time is T = 10 6 time slots. We use the rate power 
function m{Pi) = 101og 2 (l + a jf L ) packets/slot VI G £. Let 
the power of the background noise Ni = 1.6 x 10 — 14 W, V/ G 
L, and the channel gains gi = 1.6 x 10~ 13 , V/ G £. Each node 
is equipped with an infinite data buffer for each flow through 
it. The number of arrivals Al(t), t > 0 and A§(i), t > 0, 
are modeled as independent Poisson random variables with 
mean A = 20 packets/slot and A m . dx = 30 packets/slot. We 
set rj", the threshold of battery outage probability to 0.03 
for all n G N and the peak power P pea k = 1.5W. The 
backlog threshold 7 = 80 > max„ je (j2 K c(i)=n Mi + ^n) = 
2 x 101og 2 (1 + lOPpeak), so that the resulting backlog of the 
receiving node is not longer than that of the transmitting node. 




destination 2 



(a) 



destination 1 



10000 15000 

T(slot) 
(b) 



Fig. 4. (a) Network topology and (b) a sample replenishing process 

Scenario 1: We first use a replenishment process which is 
formed by a periodic deterministic sine waveform (r max = 1.2 
and period 8000) plus independent Gaussian noise with zero 
mean and variance 0.01, as shown in Figure 4 (b) (The cycles 
imitate the daily solar cycles for a solar battery and the average 
replenishing can be simply calculated f = 0.2). All the battery 
buffer sizes are set to be B b = 100 J. We simulate both MWM 
based and MM based algorithms. We choose different values 
of the control coefficient V for the proposed algorithm and 



compare the results with the optimal value 3 . From Figure 5 
(a), we see that as V increases, the average total sensing rates 
of the MWM and MM based algorithm keep increasing and get 
closer to the optimum and a value that is much larger than half 
optimum, respectively. This is consistent with Equation (31) 
and Equation (34). From Figure 5 (b), we see that as V 
increases, the average data queue length (we here only plot the 
data queue length of node 3 for flow 1 due to space limitation) 
keeps increasing but is upper bounded by the bound we get 
in Equation (29) and Equation (32). This means the queueing 
delay increases as we improve the sensing rate, which can be 
viewed as a tradeoff. From Figure 5 (c) we observe that the 
battery discharge probability (we only plot for node 5 here) 
increases to the threshold as V increases. 
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Fig. 5. Performance of the MWM and MM based algorithms. Impact of the 
control parameter V on (a) the average total sensing rate, (b) average data 
queue length, and (c) the battery discharge probability for Scenario 1. Impact 
of battery size on (d) the average total sensing rate for Scenario 2. 



Scenario 2: We use different replenishment processes: r 2 (t) 
and r^(t) are i.i.d Bernoulli random variables Bernoulli(0.5) 
(i.e., r 2 (£) = 1 w.p. 0.5 and r 2 (t) = 0 w.p. 0.5); replen- 
ishing at all other nodes are independent Bernoulli random 
variables 0.2 x Bernoulli(0.5) in even number slots, and 
0.6 x Bernoulli(0.5) in odd number slots (f 2 = f 5 = 0.5 and 
f = 0.2 for other nodes), all plus Gaussian noise with zero 
mean and variance 0.01. This replenishing process is faster 
time-varying than the one in Scenario 1. We simulate 3 three 

3 The exact optimal objective value for Problem (B) is hard to obtain, 
so we here use an upper bound for the optimum. For this example, an upper 
bound for the optimum can be obtained by equal time sharing of schedules 
{1, 4, 7} and {1, 5, 6}, and utilizing the link rate fj,(f ) under infinite battery 
size and no discharge constraint. In Scenario 1, the optimum is 2fi(f) = 30. 
In Scenario 2, since fi(f2) = /u(^b) > A, the optimum is A + (J.(f3) = 35. 
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different battery sizes = 100J, Bf, = 10J and B^ = 1J 
(all nodes has the same battery sizes) for the MWM based 
algorithm. From Figure. 5 (d), we can see that the sensing 
rate increases as battery size increases. However, as long as 
the battery is large compared to the average replenishing rate, 
the improvement diminishes with increasing battery sizes. 

Scenario 3: We simulate two tree topologies as shown in 
Figure 6 (a) and 6 (b). For both topologies, the replenishment 
processes for nodes 1,2,3, i.e., ri(t),r2(t) and 7-3 (i) are 
i.i.d Bernoulli random variables Bernoulli(0.5); replenishment 
at all other nodes are as shown in Figure 4 (b). Further, 
Gaussian noise with zero mean and variance 0.01 are added 
to all replenishment process. The battery size for all nodes are 
Bb = 200 J. Data arrives at leaf nodes 4, 5, 6, destined to root 
node 1. The arrival processes A\(t), t > 0 and A\(t), t > 0, 
are independent truncated Poisson with mean A4 = A5 = 5 
packets/slot and A max = 20 packets/slot; and the arrival 
process A\{t), t > 0 is an independent truncated Poisson 
process with mean Xq = 10 packets/slot and A max = 30 
packets/slot. From Figure 6 (c) and (d), we can see that the 
MWM based algorithm perform close to optimum and MM 
based algorithms perform much better than half the optimal 
performance 4 . Note that the MM algorithm for Figure 6 (a) 
achieves almost 100% of the optimum and MM algorithm for 
the topology in Figure 6 (b) achieves 80%, which is still higher 
than the performance (75%) for topology in Figure 4 (a) as 
shown in Figure 5 (a). The reasons are as follows: The number 
of schedules for the topology in Figure 6 (b) is more than that 
of Figure 6 (a); the tree topologies have fixed routing for each 
data flow, but for a general multihop topology such as the one 
in Figure 4 (a), there are multiple routing possibilities for each 
data flow. 

VII. Conclusion 

In this paper, we studied the problem of energy management 
in rechargeable wireless sensor networks. Our objective was 
to maximize the average data sensing rate subject to QoS 
constraints on both data and battery queues. We provided a 
simple and unified framework of joint rate control and power 
allocation for all combinations of finite and infinite data and 
battery buffer sizes. We showed through both analysis and 
simulation that the performance of our strategy is close to 
that of the optimal solution. We extended our algorithm to 
the multihop scenario and showed that simple extensions of 
our index schemes for single hop generalize to the multihop 
scenario with a similar, close-to-optimal performance. We 
developed a distributed joint rate control, power allocation 

4 Note that the average replenishing rate of nodes 1,2,3 are twice of 
the average replenishing rate of nodes 4, 5, 6, so the replenishment processes 
of 4, 5, 6 determines the upper bound of the optimum of total sensing rate. 
Furthermore, all leaf nodes are time sharing the transmission opportunity, then 
for topology in Figure 6 (a). Consequently, the upper bound of optimal total 
average sensing rate is 10, since A4 + A5 = 10 < f.(r) = 15, where f is 
the average replenishment rate in Figure 4 (b). Similarly, for the topology in 
Figure 6 (b), the upper bound of optimal total average sensing rate is 15 since 
A 4 + A 5 + A 6 = 20 > n(r) = 15. 




Fig. 6. Performance of the MWM and MM based algorithms for tree topology 
networks. Impact of the control parameter V on (c) and (d) the average total 
sensing rate for topology (a) and (b), respectively. 

and routing algorithm for multihop networks under node- 
exclusive interference model. Although we do not consider 
channel variations in this paper, taking channel variations 
into consideration is technically straightforward. Our multihop 
network formulation does not consider the fairness among 
links, however, the opportunistic scheduling framework [21] 
used to resolve the fairness issue can be applied in this context. 
For future work, we are extending this framework to the 
general interference model where the rate power function is 
not concave. 
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Appendix 
A. Proof of Proposition 1 

Using the idea similar to [17], we have the fact that 
if any queue represented with Q{t) is strongly stable, 
then limsupy^oo ^rp- — 0. Hence, if q d (t), qb(t) and 
qb(t) are strongly stable, then limsup-j^^ < ^ T - ) = 0, 
limsupj,^^ Si^l = 0 and limsupy^^ 2*4p- = 0. 

From Equation (11), we have q d (t + 1) > qd{t) — VdR(t) + 
D(t) - D(t) - fi(P(t)) + R(t) + I(t). Note that q d (t + 1) = 
q d (t) - fi(P(t)) + I(t) + R(t) - D(t). By adding from 0 to 
T — 1, dividing by T and taking limsupy^^ on both sides, 
we have 



Km sup 

T^oo 



> lim 



M0) q d (T) q d (0) 

T T^oo T 



T-l 



lim sup lj] \D(t) 



VdR(t) 



t=o 



Since lim sup T 
and lim-r 



qd(T)-q d (0) 
T 



o, 



lim<r_ 
0, 



o <p 
SO 



we 



0, 
get 



limsup^^ ^ELo 1 !- 0 ^) _ VdR(t)} < 0. Thus, 

fEw[^)-i^)] 



Pd - Vd 



■■ lim sup 



= lim sup 



-Vd 



<0, 



i.e., Vd < Vd- 

Similarly, from Equation (12), we have q b (t + l) > q b (t) — 
7] 0 + P(t)-r(t) + M(t) + l 0 (t). Note that q b (t + 1 ) = q b (t) - 
P(t) + r(t) - M(t). By adding from 0 to T - 1, dividing by 
T and taking limsupy^^ on both sides, we have 



lim sup — — > hm — ?? 0 



T 

qb (0) 



1 T_1 
lim sup - V l 0 (t) 



Since limsupy^^ _ 
and limr^ 



+ lim 

T^o< 

Qb(T) = q 



Qb(T) 



T 

limy 
0, so we 



q b (Q) 

OO rp 



0, 



get p 0 = 



limsup^^ ^J2t=o 7 °(*) ^ Vo- ■ 
B. Proof of Theorem 1 

Proof of Equation (15): Note that I(t) < (i(P(t)) and 

R(t) < A mux . The rate allocation unit RC is chosen to satisfy 
Equation (15). 

Proof of Equation (16): Since /i(-) is concave on 5R+ 1J{0}, 
we have fi(P(t)) < fj,(0) + 0P(t) for P(t) G Vt > 0, 

where 0 < (5 = /Lt'(0) < oo. Then, Q d {t)fi{P{t)) - 
q b (t)P(t) < Q d (t)fi(0)+PQ d (t)P(t)-q b (t)P(t) where P(t) 
is the solution of PA. 

If PQ d (t)P(t)-q b (t)P(t) < 0, thenwegetQ d (t) M (P(t))- 
q b (t)P(t) < Q d (t)fj,(0). However, PA chooses P(t) that maxi- 
mizes Q d (t)fx(P(t))-q b (t)P(t) which means Q d (t)fi(P(t))- 
q b (t)P(t) > Q d (t)fi(0) since 0 G n(f). Then we must have 
PQd(t)P(t) - q b (t)P(t) > 0, i.e., f3Q d {t)P(t) > q h {t)P{t). 

We now prove Equation (16) by induction. Without loss of 
generality, let q b (0) < + A max ). Suppose for all t > 1, 
q b (t - 1) < + A m . dx ) holds. In slot t, if P(t) > 0, then 
Qb{t) < f3Q d (t) < (3{\ + A max ). Otherwise, P(t) = 0, from 
Equation (10) and the assumption r(t) > 0, Vt > 0, we have 
l Q {t) = 0. We also note that M(t) < r(t), then q b (t) < 
q b (t - 1) < + A max ). Therefore, q b (t) < j3{\ + A max ) 
for all t, which is Equation (16). 

Proof of Equation (17) and Equation (18): If p a = 0, we 

are done. In the following analysis, we assume p Q > 0. 

From Equation (16), we already have limsupy^^ = 
0. In order to apply Proposition 1, we only need to show 
limsupy^^ qb !jT^ = 0. 

Claim: For any t > 0, there exists t'(t) > t such that 
qb(t' + 1) = r(i') and t' — t < oo. Suppose there exists 
t > 0 such that for any t' > t with q b (t' + 1) = r(t'), 
t' — t = oo. This means that 3N < oo such that q b (t + 1) > 
r(t), Vi > N, i.e., l 0 (t) = 0, Vi > N. Thus, Po = 
limsup^^ i J2l=o ] o(t) = limsup-r^ ^ J2^=o = 
0, which leads to a contradiction. 
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Since r(t) < r max < oo, Vt > 0, and q b (t) — q b (t + 1) < 
P(t) < -Ppeak- Combining with the above claim, we have 
q b (t) < r max + (t'(t) - t)P peak < oo, Vt > 0. Therefore, 
liirisupj,^ = o. 

Equation (17) then follows from Proposition 1. 

Equation (18) directly follows from Equation (15) Proposi- 
tion 1. 

Proof of Equation (19): 

I) Bd = oo and B b = oo. 

We define the Lyapunov function L{q d {t),q b {t)) = q d (t) + 
q 2 {t), and A(q d (t),q b (t)) = L(q d {t + l),q b (t + 1)) - 
L(qd(t),q~b(t)), where qd(t) and q b (t) are the resulting queue 
dynamics of our algorithm given a sample path. Note that 
our problem is a sample path optimization problem, so 
L(qd(t),q~b(t)) is defined without conditional expectation. 

From Equation (13), we have q 2 (t + 1) < (q b (t) — i] 0 ) 2 + 
(l 0 (t) + P(t) - r(t)) 2 + 2(q b (t) - V o) + (lo(t) + P{t) - r(t)). 
Also from the data queue dynamics, we have q^(t + 1) < 
qj(t) + ^ 2 {P(t)) + R 2 (t) + 2q d (t)R(t) - 2q d (t)»(P(t)), then 

A=A(q d (t),q b (t)) 

<^ 2 (P(t)) + R 2 {t) + (r max + Vo ) 2 - 2q d {t)fi{P(t))+ 



(1 + P f 



peak ) 



2q d (t)R(t) + 2q b (t)(l 0 {t) + P(t) - r(t)) 



<(1 + Ppeak) 2 + (r max + Vo) 2 + /"max + + VR(t) + 

2q b (t)(l 0 (t) - r(t)) + 2[q d (t) - V/2]R(t)- 
2[q d (t)Li(P(t))-q b (t)P(t)]. 



It is apparent that RC is trying to minimize the term 
[qd(t) — V/2] R(t), and PA is trying to maximize the value 
of the term [q d (t)fi(P(t)) - q b (t)P(t)]. Since the optimal 
solution for Problem (A) may not be unique, we let CP* be the 
optimal solution set and P* G V* be any optimal solution, for 
Problem (A) given any sample path. Since the constraint set 
IT(t) is queue dynamic related, it is possible that P*(t) H(t). 

Lemma 1: If by solving Equation (14), we get 
[Q d {t)ii{P{t)) - q b {t)P{t)] < [Q d {t)ii(P*{t)) - 
q b (t)P*(t)], then P(t) = q b {t) and l a (t) = 1. 
Proof: In time slot t, let P m (t) be the value that maximizes the 
unconstrained objective function Q d (t)n(P(t)^ — q b {t)P(t). 
Claim P. q b (t) < P peak . Otherwise, U(t) = [0, P peak ] 
which is not queue dynamic related, then Q d (t)/i[P(t)) — 
q b (t)P(t)]>[Q d (t)n(P*(t))-q b (t)P*(t\. 
Claim 2: P m (t),P*(t) > q b (t). If P m (t),P*(t) £ U(t), we 
must have [Q d (t)fi(P(t)) - q b (t)P(t)] > [Q d (t)fi(P*(t)) - 
q b (t)P*(t)], then P*(t) > q b (t). If P m {t) < 0, we 
will have Q d (t)A*(0) = [Q d (t)»(P(t)) - q b (t)P(t)] > 
[Q d (t)n(P*(t)) - q b (t)P*(t)], then P m (t) > q b (t). 

By the above claims, P(t) < P*(t) and P(t) < P rn (t). 
Suppose P(t) < q b (t), since the objective function of PA is 
concave in P(t) and P m {t) > q b (t), we can increase P(t) to 
increase the objective without violating the constraint. Thus, 
P(t) = q b {t) and l D (t) = 1. ■ 



By Lemma 1, we have 



A <(1 + P peak ) 2 + (r max + Vo ) 2 + ^ + A 2 m . dX + VR(t)- 

VR*(t) + (V + 2A max )(/3 + Mmax)io(t) + 

2q d (t)[R*(t) - »(P*(t))] + 2q b (t) [P*(t) - r(t)] . 



(35) 



The following lemma gives the sufficient condition for 
strong stability under sample mean argument. 

Lemma 2: For any queue with evolution q(t + 1) = (q(t) — 
fi(t)) + + X(t), if q(Q) < oo, /j,(t) < [i m . dx , Vf > 0 and 
limsupy^^ y >~2j=o (A(*) — < 0' men i s strongly 
stable, i.e., limsupy^^ ^ >~2t=o <?(*) < °°- 
Proof: Without loss of generality, assume q(0) = 0. Let 
T 0 = 0, Ti = min|t > 0 : q(t) < T 2 = min|t > 

Ti : q(t) < M*)}-., T k = minjt > T k ^ : < 

and so on. Let N k = T k - T fe _i, Vfc > 1. 
We first claim N k < oo, Vfc > 1. Suppose there ex- 
its 1 < k < oo such that N k = oo, then it means 
ELt^! ( A (*) - + !)) > 0, VT > T fe _i, and as a result, 
liminfT-,00 ^ Et=r fc _ 1 (^(*) _ + 1)) ^ 0- However, 



E (A(t)-M* + 1)) 

= E( a (*)-m*))- E 



t=0 



t=0 



+ /x(T fc _ 1 )-/x(T) 

/ * In each of the fc segaments, the sum of the arrival 
departure difference becomes negative once, and the 
negative drift is no smaller than — ^ max * / 

T-1 

< ( A W - M*)) + fc Mmax + /i(Tfc_i) - H(T), 

t=0 



then, 



T-1 



T-1 



E (A(t)-Mt + i))<E( A (*)-M*)) 

t=T fc _! t=0 

+ fcM max + i u(Pfe-i) -A»(T). 



By dividing by T and taking limsupy^^ of both sides, 
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we have 



T-l 



liminfi £ (A(i)-M* + 1)) 

t=T fc _i 

1 T_1 

< lim sup- ^ (A(i)-M* + 1)) 



T^oo 



T-l 



<limsup^^(A(t)-Mt)) + 
t=o 

fc^max + /i(Tfc_l) - /i(T) 



T^oo 



lim sup • 

T^oo 



T 



<0, 



which is a contradiction. 

Let N — maxt>i JVj, then N < oo. This im- 
plies < Nfi m . dX} Vt > 0. Now we suppose 
limsupy^^ y Y^t=o = 00 • This means there exists 
a subsequence of times {t„} such that t„ — ► oo and 

*7 Efco 1 -> °°- In other words ' VQ > 0, 3 T(Q) e 
{t n } such that Mtk E {t n } and tk > T(Q), we have 
Y^tLt) 1 Q(t) > Q- Let Q — N^i m . dx , and find the corre- 



such that 



E^"- 0-1 «(*) > 



sponding T(Nfi maxJ ...^ T(A>miix ) 

N/i mdx . Then there must exists 0 < T < T(N/i mdx ) with 
<z(T) > N^L max which leads to a contradiction. Therefore, 
limsupj,^ i ELo 1 < oo- ■ 
Lemma 3: 

T—l 

llim Sup i^ g - b (t)[P*(t)-r(0] <0(i). 



T^oo 



t=0 



Proof: Note that 

T-l 



T-l 



t=0 



£P*(t)- <fc(0) + $>(*) 



t=o 



< 0, 



then 



T-l 



J2(P*(t)-r(t)-e)<q b (0), 
t=o 

where e > 0 can be arbitrarily small. By divided by T and 
taking limsupj^^ of both sides, we have 

1 T ~ 1 

Iimsup-2(p*(t)-r(t)-e) < 0. 

Construct an auxiliary queue with the following evolution: 

q* b (t + l)=(qm-r(t)-e) + + P*(t), 

where e > 0 can be arbitrarily small. By Lemma 2, g^(t) 
is strongly stable. By multiplying q b (t) f° r both sides of the 
inequality q£(t+l) > cfi(t)—r(t) — e + P*(t) and rearranging 

terms, we obtain q b (t) [P*(t) - r(t)] < q b {t)\q* b {t + 1) - 



q b (t) + e . By summing from 0 to T — 1, dividing by T and 
taking limsupy^^, we have 



T-l 



Iltosupij>(t)[P*(t)-r(i)] 

V 7 T^oo J — 



t=0 



1 1 

<- lim sup- Vg£(t)(&(t-1) -&(*)) + 

V T-.00 J . - 



1 ,. g6(T)gg(r)-g 6 (0)gg(0) if #) 

— hm sup + hm sup - > — - e 



t=l 



1 1 V 7 

<y + W) lim sup - + -y^~2 + Am ^ 



=o(^) + (- + -^) e , 



since the average queue length of the auxiliary queue remains 
finite no matter how large the system parameter B b becomes 
and is not related to the algorithmic parameters V. By letting 
e — > 0, we finish the proof. ■ 

Note that q* d (t + 1) = (q* d (t) - /i(P*(t)))+ + R*(t) > 
9d( < )-/ i (P*( t )) + - R *( i )- B Y multiplying both sides with q d (t) 
and rearranging terms, we obtain qd(t)(R* (t) — fi(P*(t))) < 
q d (t)(q* l (t+l)-q2(t)). With the optimal policy, q* d is strongly 
stable. By summing from 0 to T— 1, dividing by T and taking 
limsupy^^, we have 

1 T_1 

limsup-J2q d (t)[R*(t)-»(P*(t))] 



<limsup^M(^^MW + 



T^oo 



1 T 

lim sup- y2q d (t)(q d (t - 1) - %(*)) 

T^oo -t t=1 

1 T 

< fi mdx lim sup — ^2 Id (t ) , 



(36) 



T^oo 



t=l 



which is a finite constant that is not related to V. 

By summing from 0 to T — 1, dividing by T and V, taking 
limsupy^^ over Equation (35), combined with Lemma 3, 
and Equation (36), we get 



T-l 



T-l 



lrniinf i V J2(t) > liminf ± V iT(t) - r/ 0 ( Mmax + /?) 



t=0 



T^oo T 



II) B d = oo and B b < oo. 

From Equation (12), we have g^(t + 1) < (q b (t) — rj 0 ) 2 + 
(l 0 (t) + P(t)-r(t) + M(t)) 2 +2(q b (t)- Vo ) + (l 0 (t) + P(t) - 
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r(t) + M(t)), then 

A=A(q d (t),q b (t)) 

<(1 + -Ppeak) 2 + (Vo + rmax) 2 + /« 2 (P(i)) + + V R(t) 

+ 2[q d (t) - V/2]R(t) -2[q d (tMP(t)) - q b (t)P(t)] 
+ 2q b (t)(l 0 (t)-r(t)+M(t)) 

<(1 + -fpeak 

+ - «*(*)) + 2g d (t)[ J R*(«) - /x(P*(t))] + 2&(i) 

[P*(t) - r(t) + M*(t)] + (V + 2A m . dX )([3 + » max )l 0 (t) 

Lemma 4: 



T-l 



llMsup i£&(t)M(t) < 0( (2^^) + }- 



t=0 



Proof: We first provide a rough idea of the proof: by exploring 
the relations between q b if) and qb(t), we notice that as qb(t) 
increases from 0 to B b , q b (t) will decrease below (| (V" + 
Anax) — Bb) + at some slot. We now give the proof details. 

Without loss of generality, let q b (0) = 0. We have the 
following cases: 

i) if P(t) > r(t), l 0 (t) = 0 and q b (t) > 0, then M(t) = 0, 
q b (t + 1) < q b (t) and q b (t + 1) - q b (t) < q b (t) - q b (t + 1), 
i.e., even if q b (t) increases, the increment is no larger than the 
decrement of q b (t); 

ii) if P(t) < r(t), l 0 (t) = 0, then if q b (t + 1) > 0, 
q b {t) - q b (t + 1) = r(t) - P(t) - M(t) -T] 0 + { Vo - 
qb(t)) + > q b (t + 1) - q b (t) = r(t) - P(t) - M(t), i.e., the 
decrement of q b (t) is no less than the increment of q b (t), else 
if q b (t + 1) = 0, it goes to case iv); 

iii) if l 0 (t) = 1, then q b (t + 1) = min[r(t), B b ] by definition 
of discharging event Equation (10). If M(t) > 0, then this 
means r{t) > B b and -r(t) + M(t) = —B b \ 

iv) if q b (t) = 0, then q b (t)M(t) = 0. 

Combine the above discussion with Equation (16), we have 
that Vi > 0, if M(t) > 0 and q b (t) > 0, we must have 

^4-max) + m^^maxj 1] — B b . Thus, 

T-l 



Ilimsupi^9 6 (i)M(i) 



t=o 



< 



+ A miix ) + max[r max , 1] - B b ) + r n 



V 



(IV-B b ) + 
V 



The remaining argument is similar to case I), and we obtain 

^ T-l T-l 

liminf - V J2(t) > liminf - V - r/ 0 ( Mmax + /?) 



t=o 



t=0 



<> 



)• 



III) B d < oo and Bb = oo. 

From Equation (13), we have q^(t + 1) < (^(i) - ry 0 ) 2 + 
(7 0 (t)+P(t)-r(t)) 2 +2(%(t)-r ?0 )+(7 0 (t)+P(t)-r(t)). Also 
from Equation (11), we have <^(i + 1) < (q d (t) - f] d R(t)) 2 + 



(I(t) + R(t) - n(P(t))) 2 + m(t) - VdR(t))+(I(t) + R(t) - 
H(P(t))), then 

A=A(q d (t),q b (t)) 

<(l + Ppeak) 2 + (??o + (P(t)) + (l+ V 2 d )R 2 (t) 

+ VR{t) + 2q b {t)(l 0 {t) - r(t)) + 2q d {t)I{t) + 2 

;i - Vd)q d (t) - ^ J2(t) - 2[« d (t) Ai (P(t)) - q b (t)P(t) 

<(1 + Ppeak) 2 + (??o + r max ) 2 + M 2 ax + (1 + ^Mmax 
+ 2q d {t)I{t) + (V + 2A max )(/3 + Mmax)io(t) + Vfl(t) 

- VR*(t) + 2q d (t)[R*{t) - /i(P*(t)) + J*(t) - D*(t)] 
+ 2&(i)[D*(t) - %i2*(t)] + 2q b (t)[P*(t) - r(t)] 

Without loss of generality, let q d (0) = B d . Similar to the 
proof of Lemma 4, we have the following cases: 

i) if D(t) > 0, then from q d (t + 1) = (q d {t) - /i(P(t)))+ + 
R(t) - D(t) and I(t) = (fJ,(P(t)) - q d (t))+, we know that 
R(t) > B d when I(t) is strictly positive. Further, whenever 
D(t) > 0, q d (t + l) = B d ; 

ii) if D{t) = 0, from q d (t + 1) = q d (t) - n(P(t)) + J(t) + 

- D(t) and q d {t + 1) = ((%(*) - r) d R{t))+ + D(t) - 
H(P(t)) + I(t) + R(t) - D(t))+, q d (t) decreases no slower 
and increases no faster than q d (t) until q d (t) hits zero. 

Note that only when q d (t) < fi mnx , I(t) may be strictly 
positive. Then combine with the above discussion, we have 



-^limsup^ q d (t)I(t) 



T-l 



V 



T 



< 



B d )+fi n 



t=o 



--0( [ -±- 



B d y 



V 



■)• 



V 

Similar to Lemma 3, we can obtain 

T— 1 

1 limsup 1 ]T q d (t)[D*(t) V d R*(t)} < O(^). 



T^oo 



t=0 



The remaining argument is similar to case I), and we obtain 

T-l , T-l 



liminf i ^ i2(t) > liminf i ^ - r, 0 (jM 



1=0 



T^oo T 



t=0 



o^)-o^- Bd)+ 



V 



)• 



IV) B d < oo and Bb < oo. 

Simply by combing case II) and case III), we obtain 



T-l 



T-l 



t=0 



liminf 1 ^ J2(t) > liminf 1 V - Vo (fi m . dx + (3) 

-)- 



t=0 



0( I)- 0( (i"- B ') + 



V 



)- 
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C. Proof of Corollary 1 

Similar to the proof of Proposition 1, we have the fact 
if q£(t) is strongly stable, then limsupy^^ = 0. 

From Equation (27), we have qg(t + 1) > qg(i) - n™ + 
E ie n„u e„ P^) - '»(*) + M n (t) + l n Q {t). Note that (i + 
1) = «?(*)- Eien BU e B fl(t) + r„(t)-M„(t). By summing 
from 0 to T — 1, dividing by T and taking limsup of both 
sides, we have 



lim sup - 2 — — 



1 

- T^o ^ _ + *T SUP T £ 7 " W 



9?(0) 



t=0 



+ lim 

do r r 

gT(Q)-C(T) 



«?(0)-«f(T) 



Since limsup^^ = 0, Hindoo ^ = 0, 

and limy^oo ^W~ g <- (T) = 0, so we get p£ = 
lim sup^ 1 J2fJ l n Q {t)< rR , Vn G N. ■ 

D. Proof of Theorem 2 

Proof of Equation (29) and Equation (32): We prove Equa- 
tion (29) by induction. Let g™ ax (t) be the maximum data queue 
length for all flows at slot t. Assume that g™™^) < \ + A max 
(holds for t = 0 by letting g£' e (0) = 0, Vn, e G H), need 
to show that it holds at slot t + 1. Consider the data queue 
Qd' e (t + 1) maintained at any node n for flow destined to any 
node e^nat slot t + 1. If node n received data destined to e 
from other node m at slot t, then by the routing policy in Sec- 
tion V and definition of w {m , n) (t), q™' e (i) - q d ' e (t) > rf mn) , 
where (m, n) is the link from node m to node n. Choose 7 
such that the resulting backlog of the receiving node is not 
longer than that of the transmitting node (let n™. dx to be the 
maximum endogenous arrivals, then 7 = ^ max + A max satisfy 
this condition), we then have q d ' e (t + 1) < q d ' e (t) + 7, then 

# e (t + 1) < C 6 (*) < lT X (*) < T + Anax. If node n 
did not receive any data destined to e from other nodes, then 
it can only have exogenous arrivals. Clearly q^ ,e {t + 1) < 
q d ' e (t) < y + A max if there were no exogenous arrivals. 
If there were exogenous arrivals, by MRC of Section V, we 
must have q d ' e (t) < \, then q%' e (t + 1) < \ + A max . Thus, 
Equation (29) holds. Equation (32) can be shown using the 
same argument. 

Proof of Equation (30) and Equation (33): Consider any 

link I*. Since W .(fl.(t)) < W .(0) +/?/}.(*) for *M*) e 
!!«.(*), Vt > 0, we have 



t«,.(t)w.(fl.(t)) - (C (r) W + C (n )*M*) 



means tu,.(*)w(iM*)) " (C^W + C^W*) > 
wi*(t)ni*(0) since 0 G H/* (i). Then we must have 

(qT {l *\t)+C {n ) P l'(t) < Pn.(t)Pi*(t)- (37) 



We now prove Equation (30) by induction. Without loss of 
generality, let q%(0) < /3(^ + A max ), Vn G N. Suppose for 
all t > 1, qg(t - 1) < /3( y + A max ), VneN holds. In slot t, 
if all the incident links of any node n is assigned zero power, 
then qg(t) < qg(t-l) < /3(y+A max ). Otherwise, let I* be an 
incident link assigned non-zero power, and by Equation (37), 
we have g£(t) < /3 Wl .(t) < I3q d m . dx {t) < 0(% + A max ), Vn G 
N. Equation (33) can be shown using the same argument. 
Proof of Equation (31): Define L(q d (t),qb(t)) = 

S^H*)) 8 + £»(«?(*)) 

A(i) =L(q d {t + l)J b (t + l)) -L(q d (t),q b (t)) 

< P™x + A*max) 2 + "max] + 2 [(1 + P P-k) 2 + 

n.e n 

+ r max ) 2 ] + 2 2 q n d e (t)K(t) + 2 ]T #(*) 



where P;» (t) is the solution of Equation (28). If 



~rec(Z*)N 



then^ we get tw,.(t) W .(fl.(t)) - (g^^'^t) + 
C° (r) )- F M*) < w;»(i)w*(0). However, solution 
of Equation (28) chooses Pi*(t) that maximizes 

«;i.(t)w(iM*)) - (C (r) W + C C(r) )^-(*) whi ^ 



i; l (t)-r„(t) + M„(t)] -2^[(^ 



tran(0,ei(t) 



Note that if g^'J^'W^) - g~W> e 'W(t) < 7i ei(t) , 
then = 0 and m(Pi(t)) = 0, then 

(«T ( ' ),e,(t) (*) - i7 {l)Mt \t)hi(W)) - Mt)MPim 

otherwise, ^an(0,e i( t) (f) _ ^(O.eiW^) > u , l(t ) i then 

( £m(U.e.(t) (t) _ g-('),e J (*) W)w( p Kt)) > Wl (t) m{ P l{ t)). 

We then have 

max + A*max) + A'max ] [(l+^peak) 2 + 

n.e n 

+ ^x) 2 ] + 2^ [i n o(t) r n (t) + M n (t)] 

n 

+ 2£# e (W(*)- 



(«r (,) (*) 



Note that Eigt^lWw^W) - w 6 

-rec(0/.-,\ ps/+M \ r.„. ,,. / ps _ /-^ranCO 



; (t))Pf(t)] > EteM-w^Ww^W) - (C (0 (*) + 

g^ ec ^^))P ; s (t)], where M(t) is the matching chosen by 
MWM algorithm and M*(t) is the matching picked by 
optimal policy in slot t, and Pf(t), VI G £ are the 
suppositionally calculated power in MPA. Since the objective 
function of power allocation component is separable over 
links and for each link it is concave, using similar argument 
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as in Lemma 1, we have 

A(t) < E [( A max + A*max) 2 + Mmax] 



of the transmitting node. We then have 

5>n*x?r*(* +i)-# e *(*)) 



n,e 

(j] 0 H~ ^max 



) 2 + (V^ + 2A max )(/3 + Mmax )^(t)] + „, e ,,x 
r t/1 -Z^^ W 



q d (*)- ¥ 



i^*(t) - Dj> e *(t) + Dsn*) + E 



(ee„ 



E (flT w,e (*) - 9Sr ( ' ),e (*) - 7K(ir(*)) + 2E 

E W) "»>,(*) +K(t)+M n (i) 
jefi„U e„ 

max max 

] +E t( y+ 

n,e n 

^ max ) 

+^E - W)) +2E«TP»W+ 



E /WW) 



> 



E «-"(*) 



K*W+E^(^W)-E^(^W) 



2 E «?"(*) 

n,e 

+ 2E* n (*) 



w)+ E-E pf( p *(*)) 

\zee„ zen„/ 



E Pr(t)-r n (t) + M*(t) 



(38) 



Similar to Lemma 3, we have 

T— 1 i- 

ilimsupiEE^W E w) 

L ien„|j e„ 



T^oo 



t=0 n 



r n (i) + M*(i) 



<0(i). 



Similar to Lemma 4, we have 

1 T_1 
„.n inf — 

V T^oo T 



1 wiEE«(»^E°(^M^)- 

£=0 n n 



v 



Note that the data queue dynamics can be written as 

q n d e (t + 1) =g^ e (t) + K(t) + £ W)) - E 

/if(fl(*))-i?r(t)+«w, 

where Z)™' e (t) is the amount of overestimated endogenous 
arrivals to the queue q^' e since Eiee Mfv-^K*)) mav be 
larger than the actual endogenous arrivals; -D™J t e (i) is the 
amount of overestimated departures since Eien fj,f(Pi(i)) 
may be larger than the actual departures. Further, at slot t, 
if node n does not have endogenous arrivals for flow e, then 
D^ e (t) = 0; if flow e is transferred from node n to node m, 
then £>££(*) = £C' e (*)- B Y choosing 7 = A max + ^ max , the 
resulting backlog of the receiving node is no longer than that 



Since q d ' e (t) — q^ e (t + 1) < Umax, Vi > 0, Vn, e G N and 
<7^' e * are strongly stable Vn, e e K, then 

lim j u p ^ E E ?r (*)(«?■"*(* + 1) - #■*(*)) 



< 



t—0 n,e 



lim 

00 



qr{T)gY\T)- q y{Q) q r\Q) 



+ 



Ai max lim sup = E 9 d ' 6 * (*) 



t=l 
T 



< E ^max lim SUp ^ E 9 d ^ ' 



(39) 



t=l 



which is a finite constant that is not related to V. Thus, 

we have £ limsup^^ 1 E^To* E„, e ?2' e (*)[*£*(*) + 

Eice.fi^W) - £ IenB /W(*))] < 0(f)- ^ 
summing from 0 to T — 1, dividing by T and V, taking 

liminfr^oo for Equation (38), we get 

e{w?eeW 

n I t=0 e J 

r T-l 

> ]T ll T minf -EE - ^"(Mmax + /?) 

n I t=0 e 



f£y - B n )+ 



) -o( v ). 



which is Equation (31). 

Proof of Equation (34): Define L(q d (t),qb{t)) = 
By MRC, we have for any n, e <E K, n ^ e that 

K' e w -v]/£(t) = [ g rw-^]<(*) < [?n*)- 

V]R* n *(t) < \[q n d e {t) - V]Rl*{t) if q n d e {t) < V; and 

[ q y{t) v] K{t) = 0 < \ [ q y{t) - k] K*{t) if ?r W > 

y. Thus, 

C e (t) v] Kit) < \ [ q y(t) - v] K*(t). 

From the efficiency of the distributed matching algorithm 
[19] [20], we have EietNWw^W) - («6 (*) + 

> iE Ie M( t) [«'i(*)w(^(t))-(«r (0 (*)+ 

C C(0 (*))^ S W] > lE« e M.(t)N(*)w(i , I a W)-(«r (,) (*) + 
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-rec(i) ^ where M ^ is the matching c hosen by MWM 

algorithm and M* (t) is the matching picked by optimal policy 
in slot t, and Pf (t) , VI E L are the suppositionally calculated 
power in MP A. 
We then have 

A(t) =i(«k(t+l),ft(* + l)) -£(gk(t),ft(t)) 

max + 7 + Mmax)^ max 
n.e n 

2A max )(/3 + Mmax)^(t) + (1 + Ppeak) 2 + (Vo + W)'] 

+ vj2K(t) + \ E [# e (*) - 



E P i*(*)-rn(*) + M*(t) 



5E«r(*)f(E-E)^(*)) 
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The remaining argument is similar to the proof of Equa- 
tion (31). ■ 
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